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We address the question of how many maximally entangled photon pairs are needed in order to build up cluster 
states for quantum computing using the toolbox of linear optics. As the needed gates in dual-rail encoding are 
necessarily probabilistic with known optimal success probability, this question amounts to finding the optimal 
strategy for building up cluster states, from the perspective of classical control. We develop a notion of classical 
strategies, and present rigorous statements on the ultimate maximal and minimal use of resources of the globally 
optimal strategy. We find that this strategy - being also the most robust with respect to decoherence - gives rise 
to an advantage of already more than an order of magnitude in the number of maximally entangled pairs when 
building chains with an expected length of L = 40, compared to other legitimate strategies. For two-dimensional 
cluster states, we present a first scheme achieving the optimal quadratic asymptotic scaling. This analysis shows 
that the choice of appropriate classical control leads to a very significant reduction in resource consumption. 

PACS numbers: 03.75.Ss, 03.75.Lm, 03.75.Kk 



To actually experimentally realize a fully-fletched univer- 
sal quantum computer constitutes a tremendous challenge. 
Among the promising candidates for possible architectures 
are the ones entirely relying on optical systems. State manip- 
ulation can then be realized using sources of single photons 
or entangled pairs, arrays of linear optical elements, and pho- 
ton detectors fl 0, II B S 01- Some of the advantages of 
such an approach are obvious: accurate state manipulation is 
available using linear optical elements, and photons are com- 
parably robust with respect to decoherence. In turn, there is 
a price to pay when avoiding the exploitation of any physi- 
cal non-linearities and effectively realizing them via measure- 
ments: due to the small success probability of elementary 
gates 01, a very significant overhead in optical elements 
and additional photons is required to render the overall proto- 
col near-deterministic. 

Consequently, one of the primary goals of theoretical work 
towards the realization of a linear optical quantum computer 
is to find ways to reduce the necessary overhead in resources. 
For the seminal scheme of Ref. 0], this overhead can not be 
reduced by simply building better elementary sign shift gates 
0. Schemes based on the model of one way computation 
point towards a reduction of resource consumption by orders 
of magnitude 0.0], a perspective that has attracted consid- 
erable interest in recent research d [Till . This de- 
velopment reminds of an inverse 'Moore's law' of the known 
minimally required resources for linear optical computing as 
a function of time. The central ingredient to these realizations 
are cluster states 0] or graph states fl2ll which can be built 
up from maximally entangled photon pairs (4-qubit cluster 
states have already been experimentally prepared Il3l0 . Fu- 
sion gates of type-I and II have been applied to the task of cre- 
ating cluster states 0[3l, deriving from parity check gates 
0] and partial Bell projections. However, these gates are in- 
herently probabilistic, in that in each step the experiment can 
either succeed or fail with the outcome being known. 

In fact, it is not difficult to show that the maximal proba- 
bility of success of a quantum gate realizing a fusion of two 
dual-rail encoded linear cluster states is p s = 1 /2, by relating 



this to the optimal success probability of a Bell measurement 
with linear optics dlH, see also Ref. O. When prepar- 
ing linear cluster states from EPR pairs, the only freedom we 
have for improvement is to identify the optimal classical strat- 
egy for fusing cluster state pieces. As the possible patterns 
of failure and success increase exponentially, an overwhelm- 
ing wealth of situations can potentially occur. Deciding how 
to optimally react to any of these situations constitutes a very 
hard problem indeed, but may have tremendous implications 
on the amount of resources needed. A similar situation occurs 
when preparing two-dimensional (2-D) cluster states. 

In this work we will address the latter question; i.e., what 
is the optimal strategy to cope with the probabilistic nature of 
fusion gates in constructing one and two-dimensional cluster 
states? While previous research has more strongly focused on 
saving resources by devising ingenious ways of implementing 
quantum gates, it is found in the present paper that choosing 
an optimal classical control strategy can cut the needed entan- 
glement by further orders of magnitude. In this way, we can 
also bound the resources that any scheme within the above 
mentioned set of rules would require. 

We begin the specific investigation with the one- 
dimensional case. Linear cluster states can be pictured as 
chains of qubits, characterized by their length I given in the 
number of edges. Maximally entangled qubit pairs (EPR 
pairs) correspond to chains with a single edge. By a configu- 
ration we mean a set of chains of specific individual lengths. 
Type-I fusion 0] allows for operations involving end qubits of 
two pieces (lengths l\ and I2), resulting on success (p s = 1/2) 
in a single piece of length l\ + 1% or on failure (pj = 1/2) in 
two pieces of length l\ — 1 and Z2 — 1. The process starts 
with a collection of EPR pairs and ends when only a sin- 
gle piece is left. A strategy decides which chains to fuse 
given a configuration. It is assessed by the expected length 
of the final cluster. The vast majority of strategies allow 
for no simple description and can be specified solely by a 
'lookup table' listing all configurations with the respective 
proposed action. Since the number of configurations scales 
as OiN 1 ' 2 exp(7r(2A^/3) 1 / 2 )) El as a function of the total 
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number of edges N, a single strategy is already an extremely 
complex object and any form of brute force optimization is 
completely out of reach. 

However, there is one simple strategy which might reason- 
ably be conjectured to be optimal. Indeed, we face a proba- 
bilistic process, and we lose entangled resources on average. 
Hence, it seems advantageous to quickly build up long clusters 
by always fusing the largest available pieces together. This 
strategy we call GREED: 

• GREED: Always fuse the largest available pieces. 

In turn, one can also be conservative and always fuse the 
smallest available pieces. This apparently inferior strategy, 
dubbed MODESTY, will not deliver long chains in early steps. 

• MODESTY: Always fuse the smallest available pieces. 

Quite surprisingly, it will turn out that not only is MODESTY 
vastly more effective than GREED, but even extremely close 
to the globally optimal strategy. 

Let us further formalize these notions. A (pure) configura- 
tion consisting of rij pieces of length li, i = 1, c, will be 
denoted as C := {4 j ■ ■ ■ ,ii™ c ^}. The total number of edges 
is given by N{C) := £\ ndi and C N := {C\N{C) < N} 
is the configuration space for N <E IN. A mixed configura- 
tion is a probability distribution p defined on the elements 
of Cn- The expected total length of a mixed configuration 
is (L)(p) := p(C)N(C). Strategies act naturally as 
stochastic matrices 11911 on mixed configurations by acting 
on every pure configuration in its support independently. Re- 
peated application of a strategy will eventually lead to a prob- 
ability distribution over configurations {l^} with only 
a single chain each. The quantity Q(C) := (L)(pf\ na i) is the 
expected yield of C with respect to the given strategy. Of cen- 
tral importance is the quality Q(C) := supQ(C), the best 
possible expected length that can be achieved starting from C 
by means of any strategy. We abbreviate Q({l( N '}) by Q(N) 
12011 . Note that the quantum nature of the cluster states does 
not enter the consideration. 

Observation 1 (Lower bound for globally optimal strategy) 

Starting with N EPR pairs and using type-I fusion gates, the 
globally optimal strategy yields a cluster state of expected 
length 

Q(N) > Q(N ) + a(N - N ) 

for all N > Nq. The constants are No = 92, Q(Nq) = 
16.1061, a = 0.153336 (known as rational numbers k2l\l ). 

For N < 2Nq a desktop computer can symbolically compute 
the performance of MODESTY Q(N) < Q(N). One finds 
that the above relation is valid in this case. For N > 2No 
input pairs we adopt the following strategy: first the input is 
divided into k blocks of length m where No < n., < 2No 
and Modesty is used to convert any such block into a single 
chain. Secondly, the resulting chains are fused together. 

If C is a configuration consisting of only two chains of 
length li > I2 one easily finds that Q(C) ~ h + h — 



2 X^lo 2_i ^ h + h - 2- More generally, it can be 
shown O that Q(N) > Y, l Q( n i) - 2 ( k ~ Now set 
Q := (Q(Nq) — 2) /No. From the computed data we know 
that (Q(rii) — 2)/n,) > a for all i. Imposing without loss of 
generality ri\ = No we see that 

k ~ 

Q(N) > Q(jV ) + Vn, Q(n ' ) ~ 2 

k 

> Q{N ) + a^m = Q(N ) + a(N - No). 

Observation 2 (Upper bound to globally optimal strategy) 

The quality is bounded from above by Q(N) < 7V/5 + 2. 

While the performance of any strategy delivers a lower bound 
for the optimal one, giving an upper bound is considerably 
harder. The following paragraphs expose all the key ideas of a 
rigorous proof (details can be found in Ref. fl7ll ). We proceed 
in three steps. The first step is to realize that, because every 
attempted fusion fails with probability 1/2 and destroys two 
edges in case of failure, the expected number of lost edges 
equals the expected number of fusion attempts T(C) a strategy 
undertakes acting on some configuration C. As the average 
final length Q(C) is nothing other than the initial number of 
edges N(C) minus the expected number of losses, we have 
Q{C) = N{C) - T{C). Hence any lower bound on T will 
supply an upper bound for Q(N). 

Secondly, we pass to a greatly simplified model - dubbed 
razor model - from which we can extract bounds for T. This 
is done by introducing a quite radical new rule: after every 
step all chains will be cut to a maximum length of two. It 
turns out that there exists a strategy in the razor model which 
terminates using fewer fusion attempts on average Tr than 
the optimal strategy for the full model. Intuitively, this is the 
case as the 'cutting operation' increases the probability for 
chains to be completely destroyed due to failed fusions. How- 
ever, making this argument precise is greatly impeded by the 
fact that one needs to compare strategies which are defined 
on different models. Indeed, given the optimal strategy of the 
full setup, there is no direct way of turning it into a strategy 
for the razor model. We solve the problem as follows. Let 
C be a configuration and C the result of removing a single 
edge from one chain in C. In ifTvtl we derive the estimate 
Q(C) > Q(C) > Q(C) - 1. Combining the findings of 
the last paragraph with N(C) = N(C) — 1, we arrive at 
Q(C') > Q(C)-1 N(C)-1-T{C) > N(C)-T(C)-1 
and hence T(C) < T(C). So removing a single edge from a 
chain decreases the expected number of fusion attempts per- 
formed by the optimal strategy. As the passage to the razor 
model can be perceived as a repeated removal of single edges, 
we can use these observations to prove T > Tr. 

In a last step we further simplify the problem in order to 
obtain a lower bound for Tr. A configuration C of the razor 
model is specified by two natural numbers I2) giving the 
number of chains of length 1 and 2, respectively. In each step 
a strategy has three options: try to fuse (a) two short chains; 



3 



(b) two long ones or (c) a long and a short chain. Consider 
the choice (a). In case of failure the chains are destroyed and 
so C h C + af where ap := (—2,0). An analogous rela- 
tions holds for successful fusions where as '■= (—2,1) and 
similar rules can be formulated for options b and c. We are 
thus naturally led to interpret the problem as a random walk 
on a two-dimensional lattice. As initially there are N single- 
edge chains in the configuration, the walk starts at (N, 0). It 
will end when there is no more than one chain left, so at po- 
sitions (1, 0), (0, 1), (0, 0). So how many steps does a proba- 
bilistic process require - on average - to cover that distance? 
If a strategy decides at some point in the walk to choose 
action a, then 'on average' the configuration will move by 
« := (as + o_f)/2 = (—2, 1/2) on the lattice. Denote by (a) 
the expected number of times a given strategy opts for a when 
acting on (N, 0). Define b, (b), c, (c) similarity. From the dis- 
cussion it is intuitive (and can be made precise 0) that any 
strategy fulfills (a)a + (b)b + (c)c < (-2V + 1,1). As the ex- 
pected number of fusion attempts Tr equals (a) + (b) + (c), 
one can obtain a lower bound by solving the linear program: 
minimize Tp subject to the constraints given above. By pass- 
ing to the dual problem l22ll an analytic solution can be found 
which gives rise to the estimate stated in Observation^ 

Observation 3 (Symbolic calculation of optimal length) 

The globally optimal strategy can be computed with an effort 
ofO{\C N \(\og\C N \) 5 ). 

We have implemented a backtracking algorithm which in ef- 
fect recursively computes the quality of all configurations up 
to some arbitrary total length. The results are stored in a look- 
up table which causes memory consumption - rather than time 
- to limit the practical applicability of the program. This ex- 
plains the dominating factor \Cn\ in the estimate of the com- 
putational effort: every con figur ation has to be looked at at 
least once. A closer analysis 1 1 711 reveals the poly-log correc- 
tion. Note that, even though the effort scales exponentially in 
N, the algorithm is vastly more efficient than a naive approach 
which would enumerate all strategies to select the optimal one 
by directly comparing their performances. 

The algorithm has been implemented using the computer 
algebra system Mathematica and employed to derive in closed 
form an optimal strategy for all configurations in C46 ll2lll . A 
desktop computer is capable of performing the derivation in 
a few hours. Starting with {1^}, MODESTY turns out to 
be the optimal strategy for all N < 10. For configurations 
containing more edges, slight deviations from MODESTY can 
be advantageous. However, the difference relative to Q(N) is 
smaller than 1.1 x 10~ 3 for N < 46. 

Observation 4 (Asymptotic performance of Greed) 
Starting with N EPR pairs and fusing them with type-I 
fusion under GREED results in an expected length of 
Q(N) = (2N/ir) 1/2 + 0(1). 

It is interesting to see how MODESTY compares with the 
asymptotic performance of the equally reasonable strategy 
Greed. Starting from {1^}, only pieces of length 1 and 
one single piece of length I > 1 may occur during the fu- 
sion process. Hence, the support of the probability distribu- 
tion is {C = l< m )} : m = 0, 1, I = 2,3, I + m < 
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FIG. 1 : Expected length for the globally optimal strategy, for MOD- 
ESTY (in this plot indistinguishable from the former), a lower bound 
(with No = 46), for GREED, its asymptotic performance, and the 
upper bound, as functions of even TV. 

N} U {l( m ) : m < N}. The implementation of GREED gives 
rise to a Markov chain on this set with a reflecting boundary 
fl^l . From this, one may determine the asymptotic behaviour 
of the expected length using a Gaussian approximation. This 
means the linear chain grows as a square root in the number 
of available pairs N, rather than linearly. 

Observation 5 (Comparison of Greed and the optimal strategy) 

To realize an expected length of 40 in a linear cluster state, 
the resources N required by GREED and the optimal strategy 
already differ by more than an order of magnitude. 

Results for the expected length using symbolic algebraic cal- 
culations are shown in Fig.^ for the strategies MODESTY; for 
the globally optimal strategy, GREED; and the lower bound of 
Observation 1, almost identical with the curve of MODESTY. 
The difference between the performance of MODESTY and 
GREED is enormous: it hence does matter indeed, concerning 
resource consumption, what classical strategy one adopts |24|. 

Recall that the expected length equals the total number of 
edges in the original configuration minus the expected num- 
ber of losses. The latter number, in turn, is proportional to the 
number of fusion attempts on average. Therefore, the opti- 
mum strategy is also the one employing the smallest number 
of fusion steps, and is hence also the most robust with respect 
to decoherence processes associated with these steps. Note 
also that the presented analysis, needless to say, can also be 
applied to other physical architectures where one has to cope 
with a probabilistic character of fusion gates, such as in matter 
qubits coupled via optical systems. 

Observation 6 (Optimal scaling for 2-D cluster states) An 

nxn cluster state can be prepared using a linear cluster state 
of length 0(n 2 ) - employing x measurements and type-ll 
fusion - such that the overall success probability satisfies 
P s (n) — > 1 as n — > 00. 

We now turn to two-dimensional structures, to be built by 
'weaving' cluster chains. Using the type-II fusion gate jj] 
in succession to an x measurement (consuming two edges) 
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FIG. 2: A possible pattern of how to arrange n + 1 linear clusters 
(threads) to weave a carpet of width n. Fusion operations have to be 
applied at the black circles along the long linear cluster state. Arrows 
mark free ends. 



delivers on success (p s = 1/2) a vertex incorporating both 
linear clusters, hence an elementary 2-D structure. In case of 
failure (losing two edges without splitting the original chains) 
the scheme described in Ref. ]4j] can be used for subsequent 
attempts, consuming 3 + 2/ edges with / being the number of 
failures. Obviously, no scheme can result in more economical 
asymptotics than 0(n 2 ) in the use of entangled resources. In 
any preparation scheme, however, overhead has to be taken 
into account to ensure a near-deterministic outcome, as a sin- 
gle failure may endanger the already generated 2-D cluster. 

Finding the overall success probability P s (n) in a closed 
form is impeded by the fact that failures on earlier vertices in- 
fluence the number of resources left and therefore the number 
of possible failures on later vertices. We are able to decouple 
these problems by considering a 'weaving pattern' as depicted 
in Fig. |3 Let us denote with m the overhead in each of the 
horizonal linear cluster states of length I = n + m, and take 
a single linear cluster state of length L = n(l + 1). We will 
show that a choice of n i— > an = m for a > 2 will be an 
appropriate choice for the scaling of the overhead. 

To start with the more formal part, based on the above pre- 
scription, the probability P s (n) of succeeding to prepare an 
n x n cluster state can be written as P s (n) = 7r s (n) n . Here, 



7T s (n) 



1 



> / an 



= 1 - F(n - I, an, 1/2). 



is the success probability of fusing a single chain of length 
to = an into the cluster, with F denoting the standard cumu- 
lative distribution function of the binomial distribution. Since 
2n — 2 < an for all n, we can hence bound ir s (n) from below 
by means of Hoeffding's inquality [23]. This gives rise to the 
lower bound ir s (n) > 1 — exp(— 2(an/2 — n+ l) 2 /(an)). 
As a > 2, one can show that liminfn-xx, 7t s (?t,)™ > 1, and 
hence, \im n ^, oq P s (n) — lirrin^oo ir s (n) n = 1, which is the 
argument to be shown. It is remarkable that for 2 > a > 
1, then lirrin^oo Ps(n) = 0, and the preparation will fail, 
asymptotically even with certainty. This argument proves that 
a 2-D cluster state can indeed be prepared using 0(n 2 ) EPR 
pairs, making use of probabilistic quantum gates. This may 
be considered good news, as it proves that the natural scaling 
in the resources can be met with negligible error. 

In this work, we have addressed the question of how to 
build optical linear and two-dimensional cluster states from 
the perspective of classical strategies. We have introduced 
tools to assess the performance of several protocols, includ- 
ing the globally optimal strategy. Further, we have shown that 
two-dimensional cluster states can be generated with resource 
requirements of 0(n 2 ), which is the most economical scaling. 
It has hence turned out that the mere classical control indeed 
does matter, and that differences in resource requirements of 
orders of magnitude can be expected depending on the chosen 
strategy. The presented techniques may, after all, be expected 
to provide powerful tools to assess and develop techniques for 
building redundancy encoding resource states ffl or to pre- 
pare states rendering linear optical schemes fault tolerant 12511 . 
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